Filtered Fictitious Play for Perturbed Observation Potential Games and Decentralised POMDPs
نویسندگان
چکیده
Potential games and decentralised partially observable MDPs (Dec–POMDPs) are two commonly used models of multi–agent interaction, for static optimisation and sequential decision– making settings, respectively. In this paper we introduce filtered fictitious play for solving repeated potential games in which each player’s observations of others’ actions are perturbed by random noise, and use this algorithm to construct an online learning method for solving Dec–POMDPs. Specifically, we prove that noise in observations prevents standard fictitious play from converging to Nash equilibrium in potential games, which also makes fictitious play impractical for solving Dec–POMDPs. To combat this, we derive filtered fictitious play, and provide conditions under which it converges to a Nash equilibrium in potential games with noisy observations. We then use filtered fictitious play to construct a solver for Dec–POMDPs, and demonstrate our new algorithm’s performance in a box pushing problem. Our results show that we consistently outperform the state–of–the–art Dec– POMDP solver by an average of 100% across the range of noise in the observation function.
منابع مشابه
Multi-agent learning using Fictitious Play and Extended Kalman Filter
Decentralised optimisation tasks are important components of multiagent systems. These tasks can be interpreted as n-player potential games: therefore game-theoretic learning algorithms can be used to solve decentralised optimisation tasks. Fictitious play is the canonical example of these algorithms. Nevertheless fictitious play implicitly assumes that players have stationary strategies. We pr...
متن کاملOn Best-Response Dynamics in Potential Games
This work studies the convergence properties of continuous-time fictitious play in potential games. It is shown that in almost every potential game and for almost every initial condition, fictitious play converges to a pure-strategy Nash equilibrium. We focus our study on the class of regular potential games; i.e., the set of potential games in which all Nash equilibria are regular. As byproduc...
متن کاملAdaptive Forgetting Factor Fictitious Play
It is now well known that decentralised optimisation can be formulated as a potential game, and game-theoretical learning algorithms can be used to find an optimum. One of the most common learning techniques in game theory is fictitious play. However fictitious play is founded on an implicit assumption that opponents’ strategies are stationary. We present a novel variation of fictitious play th...
متن کاملEvolution in games with randomly disturbed payoffs
We consider a simple model of stochastic evolution in population games. In our model, each agent occasionally receives opportunities to update his choice of strategy. When such an opportunity arises, the agent selects a strategy that is currently optimal, but only after his payoffs have been randomly perturbed. We prove that the resulting evolutionary process converges to approximate Nash equil...
متن کاملStochastic fictitious play with continuous action sets
Continuous action space games form a natural extension to normal form games with finite action sets. However, whilst learning dynamics in normal form games are now well studied, it is not until recently that their continuous action space counterparts have been examined. We extend stochastic fictitious play to the continuous action space framework. In normal form games the limiting behaviour of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011